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ABSTRACT 

Because speech sound discrimination tests can be 
contaminated by a particular kind of ’’response set,” i.e., the 
tendency not to respond to the final parts of words as effective 
stimuli, children can be mistakenly diagnosed as having very poor 
speech-sound discrimination. About 20 percent of urban lower-class 
children have this response set difficulty and could benefit from an 
appropriate remedial program. This project has been aimed at 
developing a group- administered speech-sound test, appropriate for 
administration to young lower-class black urban children. Focus of 
the study has been more on identification of response set patterns 
rather than the evaluative criteria: subjective reaction of the 
testers to administration of various formats, general performance of 
children, ability of a particular format to discriminate between 
initial and final phoneme performance, and reliability measures. 
Detailed item analyses have been made in order to locate problem 
areas of discrimination. Instructions for administering the 
speech-sound discrimination test and illustrations thereof, and 
tables of test results are appended. [Not available in hard copy due 
to marginal legibility of the original document. ] (KG) 



ABSTRACT 



A recent study has shov/n that speech-sound discrimination 
tests can be contaminated by a particular kind of "response set", 
i.e., the tendency not to respond to the final parts of V7ords as 
effective stimuli. As a result of this contamination, children 
can be mistakenly diagnosed as having very poor speech-sound dis- 
crimination. Because speech-sound discrimination is viewed as a^ 
developmental phenomena, often remedial training is put off until 
the child is about eight years of age; and then, the remedial 
training may not focus on the child’s real difficulty— the response 
set. It is estimated that about 20% of the population of urban 
disadvantaged children have this response set difficulty and could 
probably benefit by an appropriate remedial program. 

The difficultv with developing appropriate training procedures 
is that the only available test that is constructed for (though not 
scored for) distinguishing between speech-sound discrimination dif- 
ficulties and response set problems is an individually administered 
test. Any reasonably designed study v/hich would deal witii training 
procedures for children shov/ing evidence of response sets v7ould 
reguire the pretesting of about four hundred children. This is 
obviously a very costly procedure.. 

This project has been devoted to the development of a group- 
administered speech-sound test, appropriate for administration to 
young, lower-class, urban black children. We have been more con- 
cerned with the identification of the response set pattern than 
the evaluative criteria: (a) subjective reaction of the testers to 
administration of the various formats; (b) general performance of 
the children; (c) ability of a particular format to discriminate 
between initial and final phoneme performance; and (d) reliability 
measures. We also made some detailed item analyses in order to 
locate problem areas of discrimination in this particular population 

Through pilot— testing a' variety of formats, it was evident that 
although speech-sound discrimination in these young disadvantaged 
subjects could be evaluated by group procedures, much of what is 
viev;.ed as speech-sound discriirrination ^dif ficulties in test proto- 
cols, may be in reality, confusion over instructions. In addition, 
speech-sound discrimination of initial consonant sounds is quite 
different from speech-sound discrimination of final consonant sounds 
The relationship between the difficulty levels of the various _ 
phonemic discriminations in the two positions is either negative 
or zero, thus implying the need for separate measures and differ- 
ential remedial procedures. 



BRIEF DESCRIPTION OF THE PROJECT 

# 

Studies concerned v/ith speech-sound discrimination (SSD) (a 
specific type of auditory ability thought to be highly related to 
speech and reading skills) generally have produced conflicting 
results. The inconsistencies found in these studies have been in- 
terpreted by Russell and Fea (1963) and Coller (1967) to be due, 
in part, to the use of measuring instruments which evaluate (more 
or less adequateli^ different, and often seemingly incompatible 
aspects of SSD. In addition, the results of a recent study by 
Coller, Coleman, and Schwartz (1967) show that the scoring pro- 
cedure used for diagnosing SSD ability in one of the more popular 
SSD tests is contaminated by "response set®* behaviors. Coller, et 
al. , claim that the Wepman Auditory Discrimination Test (1958), a 
bi-positionally balanced, paired speech-sound test, is not, under 
its present scoring system, a valid measure of speech-sound dis- 
crimination. The suggested scoring system of the Wepman Auditory 
Discrimination Test (WADT) sometimes permits children who have only 
"response set" problems, i.e., the tendency not to treat the final 
parts of words as effective stimuli, to be mistakenly diagnosed as 
having yery PQQr, jSSp ability . It should be clear that both the in- 
accurate diagnosis of a child’s SSD ability and the use of inappro- 
priate scoring systems can most certainly help to create inconsis- 
tencies when. SSD ability is related to speech and reading skills. 

It is easy to see that a lack of understanding of what speech- 
sound discrimination tests are actually measuring can cause gross 



errors in the diagnosis of the child's speech-sound discriraination 
ability. What is not too fully realized is that such errors can be 
compounded v;hen remediation procedures are suggested. Either in- 
efficient, inappropriate, or no remedial programs are typically 
advised. For example, Wepman (1960) has suggested that, "...speech 
correction for children whose auditory discrimination develops 
slowly should not be started until after the seventh year Jp. 329J.*' 
Accordingly, if a child showing evidence of response set behavior 
is mistakenly diagnosed as having poor SSD ability, remedial train- 
ing might be put off and hot started until it is too late. 

It is estimated that about 20% of Wepman test protocols of 
the disadvantaged population are mistakenly diagnosed as indicating 
that the children have poor speech-sound discrimination ability. 
Coller, et £l«# claim that the majority of such children really 
have a response set problem, that is, attending to initial sounds, 
but not attending to the same sound v;hen located in a final position. 
This indicates a need for a test which can be administered to groups 
of children so that an accurate estimate of the child's SSD ability 
can be quickly and easily obtained. Such a test should also be able 
to determine if the child has a response set problem. 

This project has been devoted to the development of a group- 
administered speech-sound test, appropriate for administration to 
young, lov/er-class, urban black children. We have been more con- 
cerned with the identification of the response set pattern than the 
evaluation of overall speech-sound discrimination although this 
test could provide a measure of this ability. We, therefore, developed 
a test which could give us separate measures for the discrimination 



of the same phonemes in initial and final positions of minimal 
phonemic difference pairs. 

The year's v/ork involved the selection and preparation of 
items and the investigation of appropriate administration pro- 
cedures, We v/ere particularly concerned in regard to two problems; 

(1) that neither the format of the test nor the instructions should 
be a barrier to obtaining a reliable estimate of the child's ability 
to discriminate particular phonemes; (2) that the content of the 
items not be unduly unfamiliar for children of this age and back- 
ground, again, so that the measure attained should reflect speech- 
sound discrimination ability rather than some other variable such 
as the extent of the child's vocabulary. 

We consequently pilot-tested a variety of formats using several 
evaluative criteria: (a) subjective reaction of the testers to ad- 
ministration of the various formats; (b) general performance of the 
children; (c) ability of a particular forraat to discriminate be- 
tween initial and final phoneme performance; and (d) reliability 
measures. We also made some detailed item analyses in order to 
locate problem areas of discrimination in this particular popula- 
tion. 



METHODOLOGY 



Work on this project proceeded through several stages. The 
selection of items and their illustration; preparation of varia- 
tions of the same test, differing only in format or administration 
procedures; administration of the various formats to pilot samples; 
examination of the results with subsequent selection of three for- 
mats for further investigation; administration of the selected 
forms to new samples of similar subjects and analysis of the data 
obtained on the three selected formats. 

Because of our great concern to find formats and administra- 
tion procedures appropriate for use with young, urban, disadvantaged 
children, we developed five versions of the group administered 
speech-sound test. In addition, for the same reason, great care 
was exercised in the selection of items and their illustrations. 

Each test item required that a minimal phonemic discrimination 
be made. For Formats 1 and 2, discrimination had to be made be- 
tV7een illustrations of two v;ords, e,g,, a picture of a gate and a 
picture of a date. In Format 1, the child had to mark the picture 
that contained the sound voiced by the tester whereas in Format 2, 
the child had to mark the picture of the v;ord said by the examiner. 
For the three other formats (3, 4 and 5), discrimination had to be 
made between pairs of stimuli, e,g,, gate/gate and gate/date. Of 
these three formats, the first two contained literal representa- 
tions of the words, whereas Format 5 merely presented two symbols 
which were the same (two circles) versus tv;o symbols which differed 



(a circle and a square)* In Format 3, the child was told to mark 
the words said by the examiner* In Format 4, there was an addition- 
al instruction advising the child to mark the pair that was the 
same or different according to whether the words said by the examiner 
were the same or different. These last instructions were also used 
with the pictures of symbols. These latter three formats were 
developed in the hope that the additional clue regarding the same- 
ness or difference V70uld counteract any lack of familiarity with 
the stimuli. If use of the symbols were understood by the children, 
this would have vastly extended the possible range of test items. 

Since the response set phenomenon (correct response to a par- 
ticular phoneme in the initial position but incorrect response to 
the same phoneme in the final position) was obtained with the in- 
dividually administered V7epman Auditory. Discrimination Test , the 
phonemes used in each word-pair were derived from the VJepman List* 
The Wepman List includes 13 paired phonemes; (1) three voiced stops 
(g/b, d/b, g/d) ; (2) three unvoiced stops (k/p, t/p, k/t) ; (3) one 
voiced fricative (e-) j (4) five unvoiced fricatives (f/p# f/s, s/e-, 
s/sh, -e/sh); and (5) one nasal (m/n) * Our test contains eleven 
paired phonemes. Three paired phonemes were eliminated because it 
was impossible to find words to fit them which could be pictorially 
represented and recognizable by children* An r/1 paired phoneme 
was added to our list, as this pair seemed to be one of common 
difficulty in discrimination for this population. 

Each of the eleven phoneme— pairs appeared both at the begin- 
ning of a word-pair and at the end of a word-pair (e.g. , cot/gpt 
and cake/ca£e) . Altogether there are twenty-tv;o different word- 



pairs. For these twenty-tv7o items, in the single item discrimina- 
tion tests, the correct phoneme was found only in one of the two 
presented items. In the paired item discrimination tests, the cor- 
rect response was the pair of different stimuli. In addition, for 
the latter tests there v?ere also eleven items in which the correct 
response was the word-pair that v;as the "same”. These additional 
eleven items were randomly chosen from the v;ords which were in- 
cluded in the "different" items. For the additional eleven single 
discrimination items, the phoneme not selected in the original item 
was the correct choice. Position of the correct response was ran- 
domly assigned throughout each test. The items are presented in 
Tables 6, 7 and 8 in the Findings Section where they are discussed 
with reference to the difficulty level of the various phonemic dis- 
criminations . 

A total of 84 Black and Puerto Rican subjects judged to be 
lower SES on the basis of attendance in a selected school received 
either two or three of the five varying test' formats. The tests 
were administered in reading classes which met for forty minutes 
a day. Once time for the children to arrive and settle down and 
for the testers to set up was taken into account, it was found that 
only one test could be administered comfortably in one period. 
Therefore, each child was tested in two or three sessions. As a 
result, control over order of presentation, and assignment of 
subjects of egual ability to the various test formats was not 
feasible. Further, because of absences, different numbers of child- 
ren were administered the various test formats. 

Because of the assignment of reading groups organized by the 
school for testing, the size of the groups varied considerably. 
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The examiners found that because of the organization of the test 
booklet, it v;as fairly easy for the children to follow and complete 
the test. The largest group tested consisted of 13 subjects. There 
was no difficulty with a group of this size. The examiners estimated 
that administration of this test by two examiners tc ten to fifteen 
children (half a class) is very comfortable. V7e, therefore, eliminated 
the phase of this study v/hich was to be concerned with determining 
the relative size of the group to which such a test could be suc- 
cessfully administered. 

Each child was presented a booklet containing thirty-five 
pages with one test item, on each page. There were two sample items 
which were drawn on the blackboard as well as appearing in the book- 
let, and which were administered directly by the tester. The pur- 
pose of the sample items was to orient the child to the presentation 

\ 

format of the booklet and to make sure he understood the instruc- 
tions. To avoid distraction, each test item appeared on a separate 
page. To aid the subject in keeping his place in the test booklet, 
a picture such as a ball or a flov/er (in the manner of the Lorge 
Thorndike Intelligence Test — Level 1) v;as placed on the upper right- 
hand corner of each page. The children were instructed to raise 
hands if the proper picture did not appear on the page, so that a 
tester could be of assistance. During testing, there v/ere always 
at least two testers present, one to read the items and one to 
circulate around the room to aid those subjects who were having 
difficulty for one reason or another. 

In order to evaluate the various formats as to their appro- 
priateness for the purpose for which they had been designed, we 
considered both the testers* subjective reactions to administering 



o 

ERIC 







the various formats as well as more formal statistical character- 
istics of group results. 

Development of a test of this type requires a some .'hat unortho- 
dox approach to evaluation of test characteristics. Essentially, the 
test is to be used for diagnostic purposes, that is, to detect those 
subjects who have a particular response set in auditory discrim.ina— 
tion. Such an instrument, which measures perceptual discrimination, 
is not subject to the same degree of developmental influence that 
an instrument measuring a learning or cognitive skill would be. 
Phonemes are discrete rather than continuous qualities. One is 
limited, therefore, in trying to manipulate the stimuli to increase 
similarity betv/een the different sounds in order to make discrimi- 
nation more difficult. 

Therefore, the test should not be so designed as to obtain 
maximum variance among scores being administered to a group of sub- 
jects. Pvather, most children should be able to obtain a high if not 
perfect score on this test. Further, because of the decreased vari- 
ance in such a test, internal consistency measures of reliability 
may be misleading. 

In the actual evaluation of the test formats, we generally 
took the following position. High error scores were interpreted as 
representing poor administration instructions— not inability of the 
subjects to discriminate and not improved test discrimination. Be- 
cause of the specific diagnostic purpose of this test, we looked 
most favorably at those formats which discriminated V7ell between 
initial and final phoneme performance. At this point in our evalu- 
ation, the internal consistency measures of reliability had little 




influence on our approval or disapproval of a particular format. 

Format 1 was discarded because of both quantitative and quali- 
tative objections e The testers felt that the children found the task 
confusing. In addition, some teachers commented that this was an 
inappropriate testing procedure since the instructions required 
behavior contrary to approved teaching procedures for reading. The 
fact that this test had the second highest mean error scores, 3,58 
for initial phonemes and 5.15 for the final phonemes (Table 1), 
seems to confirm the test administrators* judgment of the inade- 
quacy of this format. 

Format 2 was most liked by the test administrators. Although 
the Kuder-Richardson reliabilities coefficients (Table 2) v/ere not 
satisfactory, the mean error scores were lov; and there V7as good 
discrimination between initial and final phoneme performance. We, 
therefore, decided to obtain additional data on this format. 

Format 3 was also chosen by the examiners as being very appro- 
priate for the sample of children tested. In the case of this vari- 
ation, the Kuder- Richardson reliabilities (Table 2) are very good, 
but the mean error scores are relatively high and discrimination 
between initial and final phonemes performance is relatively minor 
(Table 1) . In the case of this format also, we decided to gather 
additional data. 

Format 4 was disliked by the testers. The addition of the same- 
different instructions seemed to be redundant and confusing and 
probably not attended to by the subjects. Hov/ever, despite the low 
reliability coefficients, the mean error scores were the lowest and 
the discrimination between initial and final phoneme performance 



was as good or better than the discrimination obtained v;ith any 
of the other formats (Table 1), V7e, therefore, decided to include 
this variation for further investigation despite the administrators* 
strong dislike of it. 

Format 5 was disliked by the examiners. The testers felt that 
the children v/ere unable to understand the instructions. The high 
mean error scores and poor discrimination bctv;een initial and final 
phoneme performance seemed to bear out the examiners* judgment 
(Table 1) , 

In summary, we discarded Formats 1 and 5, and retained For- 
mats 2, 3 and 4 for further examination. 

At the conclusion of the initial pilot-testing, the investi- 
gators were not certain that the order of presentation of tests to 
the same children and the relative ability of children assigned to 
given formats had not confounded the results. To control for these 
factors, each of the three formats selected for further investiga- 
tion was administered to an independent sample of lov 7 SES, blacky 
first-grade subjects. Seventy subjects were included in each sample. 
Each sample was tested twice v;ith one week interval between test- 
ing in order to obtain test- retest reliability data. Because of 
absences and invalid protocols, the size of the final samples were 
57 for Format 2, 52 for Format 3, and 44 for Format 4, This time 
the formats were put on tape and played to each group of subjects. 

This procedure insured more uniform administration of the test. How- 
ever, as in the first pilot- testing, the sample items were administered 
by the examiner. The subjects v;ere assigned to the different for- 
mats by randomly dividing each first-grade class in half. Each 
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half of any class (ten to fifteen children) received a different 
format. The format selection was determined beforehand in such a 
manner that each combination of tv;o formats occurred V 7 ith equal 
frequency. By dividing the classes in this v/ay, chances of obtain- 
ing essentially equal samples across the three treatments with 
respect to speech-sound discrimination ability were improved. 

The data obtained from this second sample v;as then more 
thoroughly examined. In addition to the calculation of means and 
standard deviations for the initial and final phoneme subtosts of 
the three formats (Table ’3), test-retest (Table 4), and Kuder- 
Richardson reliability coefficients (Table 5) were calculated. 

Item analysis to determine the difficulty level of the items over 
the three formats was completed (Tables 6, 7 and 8) . Consistency 
of item difficulty from initial to final position within each for- 
mat as v;ell as consistency of item difficulty across formats v/as 
estimated by rank order correlation coefficients (Tables 9, 10 and 
11 ). 
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FINDINGS 

Reported in this section are the results of the second pilot 
testing with the three selected formats (2, 3 and 4), 

From Table 3, containing Ns, means and standard deviations 
for the initial and final phoneme subtests for all three formats, 
it can be seen that maximum discrimination between initial and 
final phoneme performance was obtained v.^ith Format 3, both for 
the initial and repeated administrations of the test. Format 3 had 
the highest reliabilities coefficient associated v;ith it — both the 
preferred test- retest reliability coefficient, r =,76 (Table 4); 
and the Kuder-Richardson internal consistency reliability measure, 
r =.69 (Table 5). These findings, combined with the examiners* sub- 
jective evaluations, have led us to select Format 3 as appropriate 
for administration to young, disadvantaged children. 

However, the results are somev;hat less than ideally satisfac- 
tory; the generally high means indicate that the subjects v;ere 
hitting the ceiling of the test. The low reliability coefficients 
reflect the lack of variance and ceiling effect, As discussed in 
the Methodology Section, the cause of this may be, to some extent, 
inherent in the nature of the test. Making a phonemic discrimina- 
tion is an all-or-none matter, probably achieved at relatively 
young ages. Nevertheless, the data from all three formats was 
examined in greater detail to gain more understanding of v/hat had 
occurred. 

Essentially, the final phoneme subtest carried the entire test. 
From Tables 4 and 5, it can be seen that the reliability coefficients 
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obtained v/ith the final phoneme subtest are almost equal in value 
to those obtained v;ith the entire test. 

Tables 6, 7 and 8 present the phonemic discriminations required 
of the subjects and the percentage of subjects making correct re- 
sponses for each discrimination as it appeared in the initial and 
final positions for the three formats (2, 3 and 4, respectively). 

In Table 9, the percentage correct for each discrimination in 
both positions has been averaged across test administrations and 
repetitions of the discriminations, and then ranked. 

From the correlations reported in Table 10, it can be seen 
that the rank positions of both the initial and final phonemic 
discriminations remain fairly constant from one test to another. 

Examination of Table 11, coefficients of the rank difficulty 
betv/een phonemic discriminations in initial and final positions 
across test-retest administrations of the three formats, reveals 
consistent substantial positive correlations for both the initial 
and final phoneme subtests. Hov/ever, v?ithin any given administra- 
tion the correlation between the initial and final phoneme sub- 
tests is either negative or negligible. This explains the lack of 
improvement in total test reliability which one would expect from 
the increased number of items involved. V7e seem to have two quite 
different tests,, one considerably more difficult than the other. 

Although av;are that generally none of the discriminations were 
very difficult for the children to make, we, nevertheless, examined 
Tables 6 through 9 to determine which phor 2 mic sounds v;ere respon- 
sible for most of the variance of the test. Although their ranks 
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within the first five posit5.ons across the formats do change, con- 
sistently ^ the five most difficult initial discriminations are: 
g/d, ch/sh, m/n, sh/s and s/f. One pair, p/c, v;as found difficult 
in Format 2, but easy in Formats 3 and 4, In final position, the 
five most difficult sounds arejp/c, d/b, g/c, sh/s and g/b. Tvro 
inconsistencies among formats occurred: t/p v/as second in difficulty 
in Format 3; and r/1 v/as second in Format 4, No explanation is 
found for these inconsistencies. It appears that g/d and sh/s are 
. among the most difficult in both initial and final positions. 

Since there is only one item per phonemic discrimination in 
each position, it is difficult to determine from the tests ;7hether 
or not some difficulty may be related to the phonemic context, that 
is, the possible differences in vov^el sounds across varying stimuli, 
or even, the general familiarity of the items. Specific reactions 
to particular stimuli may also account for variations in difficulty 
level,- The examiners believe that the difficulty of the s/f dis- 
crimination may have been due to the children's amused reaction to 
the illustration of "feet," v;ith consequent lack of attention to 
what the examiner (or tape) v/as saying. The exairdners further be- 
lieve that the difficulty associated V7ith the g/d discrimination 
in the initial position may be related to its being the first item 
of the test. The rhyming nature of the test items, which is not so 
for the sample items, may at first confuse the subjects. 
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cowcLusions 



It is possible to develop a QToup~adininistered speech-sound 
discrimination test appropriate for administration to youn^ dis- 
advantaged children, (Such a test should be adrf:ini stereo by tv;o 
testers to groups ranging up to approximately 15 in nurrher. ) This 
test can differentiate between the child’s ability to distinguish 
the same sounas in iiiitiai and final positions, vzhich is inipoirtcint 
information for remedial, procedures. 

From our work with various formats, it v/ould appear tliat much 
of what- is viewed as speech-sound discrimination difficulties in 
test protocols may be, in reality, confusion over instructions. 

When the instructions become clearer, the group performs at a high- 
er level. 

Judging by data from previous work at the Institute v/ith 
similar samples, the test described above, despite beina croUD- 
administered, is easier than the Wepman Aud itory Discrimination 
Test , an individually administered instrument. One possible explana- 
tion is that, by discarding the same-different instructions, a con- 
founding factor to the raeasurement of children’s speech-sound dis- 
crimination ability may have been eliminated. Evidence supporting 
this contention can be seen in the results w'ith Format 4, which V 7 as 
identical to Format 3 except for the inclusion in Format 4 of same- 
different instructions. Children taking Format 3 obtained higher 
scores than those taking Format 4, 

From our data it seems that, at least for the samples we used, 
speech-sound discrimination in initial consonant sounds is quite 
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different from speech-sound discrimination in final consonant 
sounds. Different j^honemic discrirainations are difficult as a 
function of position in the v/ord. This has great implication for 
remedial techniques, as well as test construction of speech- sound 
discrimination tests. Different phonemes should be focussed upon, 
depending upon the context in v/liich they appear. To achieve highly 
reliable measures of this ability, it is probable that separate 
tests or subtes.ts should be administered for discrimination within 
each position. One cannot assume that adding another item V7ith a 
phonemic discrimination in another position is equivalent to in- 



creasing the number of items of similar nature. 

The test developed in this project reflects this problem. To 
improve the general reliability of this test it would be essential 
to add more items with the Scime phonemic discriminations to each 
subtest. Although this test was patterned after the V7epman, other 
sources of information regarding phonemic discrimination problem 
areas for this population (e.g., descriptive linguistic stuaies) 
drav/n on as guides for additional items which v/ould in- 
crease the diagnostic utility of the test. 
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TABLE 1 



Ns, Hean Error ^ Initial and Final Phoneme Scores 
and Standard Deviations for Five Variations cf a 
Group Speech Sound Discrimination Test 

Initial Final 



ForTnat 


N 


X 


S.D. 


X 


S.D. 


1 


48 


3.58 


2.98 


5.15 


3.23 


2 


29 


1.10 


1.26 


2.83 


1.90 


3 


41 


2.37 


3.22 


2.80 


2.93 


4 


24 


.75 


.94 


1.88 


1.35 


5 


35 


4.94 


3.64 


5.29 


3.07 
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TABLE 2 



Kuder-Richardson #20 Reliability Coefficients for 
Initial and Final Phoneme Subtests of Three 
Variations of a Group Speech Sound 
Discrimination Test 



Format. 


N 


Initial 


Final 


2 


29 


.65 


.69 


3 


41 


.93 


.88 


4 


24 


.53 


.55 
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TABLE 3 



. Means and Standard Deviations for Initial and Final 
Phoneme Subtest Scores and Total Test Scores for 
Three Formats of a Group Administrated 
Speech Sound Test 



Initial 



Format 


N 


X 


S.D. 


2 


57 


15.89 


1.02 


3 


52 


15.38 


1.42 


4 


44 


13.95 


2.22 



2 


57 


15.56 


1.28 


3 


52 


15.60 


1.60 


4 


44 


14.86 


1.50 



Final 




Total 


Test 


*««• 

X 


S.O. 


X 


S.D. 


•w* 








Test 




* 




13.28 


1.86 


29.18 


2.40 


11.75 


2.61 


27.10 


3.23 


10.98 


2.51 


24.93 


5.09 



Retest 








13.47 


1.96 


29.04 


2.82 


12.94 


2.28 


28.54 


3.27 


13.79 


1.91 


28.34 


2.90 



Maximura Possible Scores 

Initial Phonemes (17) 
Final Phonemes (16) 
Total Test (33) 







TABI.E 4 



Pearson Product Moment Test-Retest Reliability 
Coefficients for Three Formats of a Group 
Administered Speech Sound Test 



Format 


N 


Initial 

Phonemes 


Final 

Phonemes 


Total 

Test 


2 


57 


• 

to 

o 


.50 


.50 


3 


52 


• 

to 


.69 


.76 


4 


44 


.10 


.44 


.40 



er|c 




[ TABLE 5 



Kuder- Richardson #20 Reliability Coefficients for 
Two Administrations of Three Formats of a 
Group Administered Speech Sound Test 



First Administration: Second Administration: 



Format 


N 


Initial 


Final 


Total 

Test 


Initial 


Final 


Total 

Test 


2 


57 


.13 


.51 


.56 


,22 


^57 


,64 


3 


52 


.32 


.61 


.61 


,53 


.59 


.69 


4 


44 


.59 


.54 


,83 


.29 


,53 


.61 
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TABLE 9 



• Rank Order Difficulty Level of Phonemic Discriminations 
(Averaged Across Test and Retest Administrations) 
m Xnrtxal and Final Positions for Three Forifiats 



Rank Difficulty Order 



Phonemic 

Discrimination 



Initial Position 
Formats 

2 3 4 



Final Position 
Formats 
2 3 4 



g/d 


1 


5 


3 


3 


3 


4 


t/c 


8 


6 


6 


8 


9 


8 


ch/sh 


2 


4 


1 


11 


8 


10 


g/b 


11 


9.5 


11 . 


5 


4 


5 


p/c 


4 


7.5 


10 


1 


1 


1 


n/m 


3 


3 


4 


7 


11 


11 


sh/s 


5 


2 


5 


4 . 


5 


3 


t/p 


9 


9.5 


8.5 


9.5 


2 


7 


d/b 


7 


7.5 


8.5 


2 


6 


6 


s/f 


10 


1 


2 


6 


10 


9 


r/1 


6 


11 


7 


9.5 


7 


2 



o 

ERIC 



i 
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TABLE 10 



Spearman-Brov/n Rank Order Correlation Coefficient^ 
Between Difficulty Levels of Phoneme Discriminations in 
Initial and Final Positions Among Three Formats 



Formats 

Format 3 4 





initial 


FUfTil 


Initial 


Final 


2 


.30 


.44 


• 

CO 


.52 


3 




— 


.77 


.71 
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TABLE 11 



Spearman- Brov;n Rank Order Correlation Coefficients 
Betv^een Difficulty Levels of Initial and Final Phonemic 
Discriminations Across Test-Retest Administration of Three Formats 
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Format 



Rank Order Correlation Coefficients 



2 

3 

4 



I 1st Ad m X I 2nd Adm 
.67 
.24 
.69 



F 1st Adm X F 2nd Adm 
.89 
*. 71 
.39 



2 

3 

4 



I 1st Adm X F 1st Adm 

-.06 

-,20 

-.63 



I 2nd Adm x F 2nd Adm 
.13 
-.83 
.01 



^AV ^ ^AV 



2 

3 

4 



.08 

-.51 

-.54 
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SPKECH-SOUnD DISCRIMIi-IATXOIl TEST 
Administration Diagrain 



FOitMAT 3 



P = Procedure 



Language 



(P) First, seat the children far enough apart so tliat^thcy are 
not able to copy easily from each other. V/e have tested up to 15 
at a time, but you raay do more if you have space and they can all 
hear the tape recorder. Hake sure they all have a. pencil or crayon. 
Hand out the bcoldets and have them each v/rite their ^ first and last 
name on the front, helping those v/ho can’t v/rite their names, 

(L) "WE ARE GOING TO PLAY A G7dIE VaTH FIXTURES . FIRvST, WE WILL 
HAND OUT THESE BOOKLETS, AND THEN WE WILL TELL YOU IIOVJ TO PLAY, 



Testing; 



1, (P) Dravj two boxes on the blackboard, the 
ing a star and a box, the second, a box and a 
page i in booklet.) 

2. (L) "OPEN YOUR BOOKS TO THE FIRST P/XE. I 
LIKE THIS." (pointing to the board) 



first contain- 
hox. (Same as 

T SHOULD LOOK 



3, (L) "FIRST IS k PICTU1U3 OF A STAR AND A BOX." (point to 
first picture) ,' "NEXT IS A PICTURE OF A BOX AND A bOX." (point 
to second picture) "PUT 7iN 'X* ON THE ONE I SAY, BOX , BDX, 

(P) 2Tow, pick a cliild to come up and show you which^pne 
is correct. Then ask if everyone put an X on that one. They 
v/ill be confused about \vhat kind of X to make, so at thxs 
point, tell them they should "MAKE BIG X LIKE THIS" ; and • 
you make a big X covering the entire Box containing box, cox, 

4, (P) Erase this example and this time drav/n tw'o boxes, the 
first containing a box and a star, the second, a star and a 
star, (Sam.c as page ii in boo);let) 

(L) "TURN THE PAGE, FIRST IS A PICTURE OF A BOX AND A 
STAR" (ooint to first picture) • "NEXT IS A PICTURE OF A ST7\R 
AND A War. put an *X* on the ONJj I SAY: BOX ^ ST7\R . " 

(P) Make sure they all do it correctly and understand 
the instructions. 
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5. (L) "TURT^ TO THE MEXT PAGE, NOW I7E ARE GOING TO LISTEN 

TO THE TAPE RECORDER. EVERYONE PLEASE BE VERY QUIET AND 
AND DO EX7\CTLY V-7HAT THE TAPE TELLS YOU TO DO." 



LISTEN, 



Turn on recorder, 

6, (P) As recorder plays, V7alk around air.ong the students, 
checking to see that they are on the right page, etc. If a 
page is missing, or there are tv70 the same, tell them not to 
V7orry, and help them go on to the next page, as the recorder 
gets to it, 

7, (P) If there are any disturbances, such that you see soirie 
of them didn't hear the recorder, stop the tape, back it up, 
and tell them you are going to play the last one over so that 
everyone can hear it. Make sure the room is as quiet as pos- 
sible at all times. If necessary, take out of the room anyrne 
who continues to disturb the rest, 

8, (P) At end of test, collect booklets, and mark on front 
of each booklet the teacher's name and the grade. 



PAIHED ITE'-IS (test formats 3 & ^0 



Sarnple it eras: 



Test: 



i. star /box 


gate/date 


ii , box/star 


star/star 


1, gate/gate 


gate/datc 


2, bat /back 


bat /bat 


3 • chop/ chop 


choo/shoo 


4 , tug/tub 


tub /tub 


5» cot /cot 


cot /pot 


6, gum/gun 


gun/gun 


7. sack/sack 


shack/ sack 


8, gas /gas 


gas /gash 


9. tool/tool 


pool/tool 


10, deer/deer 


deer /beer 


11,. seat /feet 


feet/feet 


12, top/cop 


cop/ cop 


13. vatchA-ratch 


wa.sh/watch 


l4, goat /goat 


goat /boat 


15. robe/road 


robe/robe 


l6, fire /fire 


fire/ file 


17. lock/lock 


lock/rock 


l8, ca-o/cat 


cat /cat 


19. lass /laugh 


laugh/laugh 


20, tool/oool 


pool/pool 


21, bug/bug 


bud/bug 


22, gum/gum 


gun/gum 


23, cake/caue 


cake/cake 


24, beer/deer 


beer /beer 


25. back/back 


back/bat 




# 



26 . 


nail/nai3. 


nail/nail 


27. 


lass /lass 


lass/lauf?h 


28. 


rock/lock 


rock/rock 


29. 


boat /<y oat 


boat /boat 


30. 


tug/tu^ 


tub /tug 


31. 


pot /cot 


pot /pot 


32. 


cape/cat)e 


cape/cake 


33. 


top/top 


COo/tOTD 
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